Hoeffding Inequalities for Join Selectivity Estimation and Online Aggregation
نویسنده
چکیده
We extend Hoe ding s inequalities for simple averages of random variables to the case of cross product averages We also survey some new and existing Hoe ding inequalities for estimators of the mean variance and standard deviation of a subpopulation These results are applicable to two problems in object relational database management systems xed precision estimation of the selectivity of a join and online processing of aggregation queries For the rst problem the new results can be used to modify the asymptotically e cient sampling based procedures of Haas Naughton Seshadri and Swami so that there is a guaranteed upper bound on the number of sampling steps For the second problem the inequalities can be used to develop conservative con dence intervals for online aggregation such intervals avoid the large intermediate storage requirements and undercoverage problems of intervals based on large sample theory
منابع مشابه
SOLUTION-SET INVARIANT MATRICES AND VECTORS IN FUZZY RELATION INEQUALITIES BASED ON MAX-AGGREGATION FUNCTION COMPOSITION
Fuzzy relation inequalities based on max-F composition are discussed, where F is a binary aggregation on [0,1]. For a fixed fuzzy relation inequalities system $ A circ^{F}textbf{x}leqtextbf{b}$, we characterize all matrices $ A^{'} $ For which the solution set of the system $ A^{' } circ^{F}textbf{x}leqtextbf{b}$ is the same as the original solution set. Similarly, for a fixed matrix $ A $, the...
متن کاملConcentration Inequalities
1.1. Azuma-Hoeffding Inequality. Concentration inequalities are inequalities that bound probabilities of deviations by a random variable from its mean or median. Our interest will be in concentration inequalities in which the deviation probabilities decay exponentially or superexponentially in the distance from the mean. One of the most basic such inequality is the Azuma-Hoeffding inequality fo...
متن کاملMulti-way spatial join selectivity for the ring join graph
Efficient spatial query processing is very important since the applications of the spatial DBMS (e.g. GIS, CAD/CAM, LBS) handle massive amount of data and consume much time. Many spatial queries contain the multi-way spatial join due to the fact that they compute the relationships (e.g. intersect) among the spatial data. Thus, accurate estimation of the spatial join selectivity is essential to ...
متن کاملExtensions of the Hoeffding-Azuma inequalities
In this paper we give extensions of the Hoeffding-Azuma inequalities for weighted sums of uniformly bounded martingale differences. Our results improve previous results of Antonov (1979).
متن کاملMoment Inequalities for Supremum of Empirical Processes of U-Statistic Structure and Application to Density Estimation
We derive moment inequalities for the supremum of empirical processes of U-Statistic structure and give application to kernel type density estimation and estimation of the distribution function for functions of observations.
متن کامل